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DETAILED ACTION 

Response to Amendment 

1 . In response to the Office Action mailed 9/24/07, Applicants have submitted an 
Amendment, filed 12/26/07, amending claims 1-5, adding new claims 7-9, without adding new 
matter, and arguing to traverse claim rejections. 

Response to Arguments 

2. Applicant's arguments with respect to claims 1-9 have been considered but are moot in 
view of the new ground(s) of rejection, below. 

Specification 

3. The abstract and disclosure have been amended and these changes are acceptable. Thus, 
the objections have been withdrawn. 

Claim Objections 

4. Claims 1-3 have been amended and these changes are acceptable. Thus, the objections 
have been withdrawn. 

5. Claims 1 , 3 and 5 are objected to because of the following informalities: Line 1 1 of 
claims 1 and 3, and line 10 of claim 5, recite "coefficient l,f which should be -coefficient y t j~. 
Appropriate correction is required. 

Claim Rejections - 35 USC § 112 

6. Claims 1, 4 and 5 have been amended and these changes are acceptable. Thus, the 
rejections have been withdrawn. 
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Information Disclosure Statement 

7. The information disclosure statement (IDS) submitted on 12/26/07 was filed after the 
mailing date of the Non-Final Rejection on 9/24/07. The submission is in compliance with the 
provisions of 37 CFR 1 .97. Accordingly, the information disclosure statement is being 
considered by the examiner. 

Claim Rejections - 35 USC § 103 

8. The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 

obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set forth in section 
102 of this title, if the differences between the subject matter sought to be patented and the prior art are such that the 
subject matter as a whole would have been obvious at the time the invention was made to a person having ordinary 
skill in the art to which said subject matter pertains. Patentability shall not be negatived by the manner in which the 
invention was made. 

9. Claims 1, 3-5 and 7-9 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Agarwal et al. (hereinafter "Agarwal"), "Two-Stage Mel-Warped Wiener Filter for Robust 
Speech Recognition" (published 1999) in view of Bayya et al. (hereinafter "Bayya"), US Patent 
6,446,038, and further in view of Bernard et al. (hereinafter "Bernard"), "Low-Bitrate 
Distributed Speech Recognition for Packet-Based and Wireless Communication" (published 
2000). 

Regarding claims 1 and 3 , Agarwal teaches a method for performing time and frequency 
SNR dependent weighting in speech recognition comprising the steps of: for each speech frame 
or time period t, estimating the SNR to get time and frequency SNR information rj t j and 
calculating the time and frequency weighting to get weighting coefficient y t j wherein y t j is a 
function of r\ t j (Section 2.1, Formulation of Mel- Warped Wiener Filter; Section 2.2, "Mel- 
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warped Wiener filter requires an estimate of the noise power spectrum. . .a precise estimate of 
noise is essential to insure the algorithm performance"; Fig. 2, frequency domain Wiener filter, a 
Weiner filter filters out noise that has corrupted a signal; the Weiner filter weights frequencies, 
and since Agarwal teaches a time -varying Weiner filter, Agarwal also teaches weighting in 
time). 

Agarwal teaches in section 2.3 and Fig. 2, "frequencies of the mel-warped discrete cosine 
transform. . .transfer function of the Wiener filter in the frequency domain is then 
constructed. . .noise spectrum is estimated. . .subtracted from the power spectrum of the noisy 
signal... inverse Mel-DCT is computed to obtain the filter in the time domain." Agarwal does 
not teach spectral and cepstral distances, but Agarwal in combination with Bayya teaches: using 
an inverse DCT matrix M" 1 to transform a cepstral distance (o t -u) associated with the speech 
frame t, to a spectral distance; computing a weighted spectral distance by applying time and 
frequency weighting to the spectral distance employing a time-varying diagonal matrix G t which 
represents the weighting coefficient y t f, and transforming the weighted spectral distance to a 
weighted cepstral distance employing a forward DCT matrix M to get a transformation matrix T, 
(Bayya teaches in col. 3, 11. 1-8, "performing spectral analysis in different domains. ..speech 
samples may be analyzed according to. . .spectral analysis techniques. . .the cepstral coefficient 
vectors are used as features"; col. 3, 11. 30 - col. 4, line 32, "speech samples. . .transformed into 
an appropriate domain. . .2) Log spectral distance... 6) LPC and PLP (Perceptual Linear 
Prediction) cepstral distances. . .cepstral coefficients. . .speech frames transformed into various 
domains"). 
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It would have been obvious for one of ordinary skill in the art at the time the invention 
was made to combine the teaching elements of Agarwal with Bayya because Bayya teaches the 
"weighted cepstral distance measure is quite effective in equalizing the performance of the 
recognizer across different talkers" (p. 1419, end of 3 rd paragraph). 

Agarwal teaches providing the transformation matrix T t and the original MFCC feature o t 
that contains the information about the SNR to a recognizer (bottom of Section 2.3, "the power 
spectral density is then used to directly calculate the cepstrum for further processing in speech 
recognition systems"; section 3, experimental results, "mel-frequency cepstral coefficients"). 

Agarwal does not explicitly teach, a recognizer including Viterbi decoding and 
performing weighted Viterbi recognition bj(o t ). However, weighted Viterbi recognition/Viterbi 
decoding is old and well known in the art as evidenced by Bernard in Section V. Weighted 
Viterbi Recognition (WVR), pp. 575-576. It would have been obvious for one of ordinary skill 
in the art at the time the invention was made to perform weighted Viterbi recognition because it 
preserves synchronization of the Viterbi algorithm and significantly reduces word error rate, as 
indicated by Bernard on p. 577, lines 7-9 of Section C. 

Regarding claim 7 , the limitations of claim 7 are the same as or similar to those of claims 
1 and 3, rejected above, and thus are rejected for the same reasons. 

Regarding claims 4 and 8 , Agarwal suggests wherein the estimating the SNR to get time 
and frequency SNR information rj u f is a pronunciation probability estimation (Section 3.2 
teaches experiments with the AURORA database). 

Regarding claims 5 and 9 , Agarwal suggests wherein the estimating the SNR to get time 
and frequency SNR information tj t j is a transmission over a noisy communication channel 
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reliability estimation (Section 1, 11. 1-4, teaches "speech recognition in an automobile noise 
environment, where colored noise with intensity as high as or even higher than the input 
speech"). 

10. Claims 2 and 6 are rejected under 35 U.S.C. 103(a) as being unpatentable over Agarwal 
et al. in view of Bayya et al. and Bernard et a!. , and further in view of Pastor et al. , US Patent 



Agarwal teaches Wiener filtering (pp. 1-2), which suggests wherein ' ' ^ ' " , 

which guarantees that y,/is equal to 0 when r],j=0 and y t j approaches 1 when r\ t j is large, the 
equation y t j being a Wiener- type filter. It would have been obvious for one of ordinary skill in 
the art at the time the invention was made to use a Wiener-type filter because Wiener filtering 
enables the separation of the signals by decorrelation. Its importance is related to the simplicity 
of the theoretical computations. Furthermore, it can be applied to a multitude of particular 
processes such as the removal of a noise that is polluting a speech signal, as indicated by Pastor 
et al, in col. 1, line 66 - col. 2, line 5. The equation y t j is just an alternative method of 
performing weighting for emphasizing the signal or the noise. 



6,445,801. 
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Conclusion 

1 1 . The prior art made of record and not relied upon is considered pertinent to applicant's 
disclosure: Caceres et al. (US Patent 6,167,133) teaches echo detection, tracking, cancellation 
and noise fill in real time in a communication system. 

Bruckner et al. (US Patent 6,678,657) teaches a method and apparatus for a robust feature 
extraction for speech recognition. 

Hermansky et al. (US Patent 5,450,522) teaches an auditory model for parameterization 
of speech. 

Boll et al. (US Patent 4,897,878) teaches noise compensation in a speech recognition 
apparatus. 

Tohkura teaches "A Weighted Cepstral Distance Measure for Speech Recognition." 

12. Applicant's amendment necessitated the new ground(s) of rejection presented in this 
Office action. Accordingly, THIS ACTION IS MADE FINAL. See MPEP § 706.07(a). 
Applicant is reminded of the extension of time policy as set forth in 37 CFR 1.136(a). 

A shortened statutory period for reply to this final action is set to expire THREE 
MONTHS from the mailing date of this action. In the event a first reply is filed within TWO 
MONTHS of the mailing date of this final action and the advisory action is not mailed until after 
the end of the THREE-MONTH shortened statutory period, then the shortened statutory period 
will expire on the date the advisory action is mailed, and any extension fee pursuant to 37 
CFR 1 .136(a) will be calculated from the mailing date of the advisory action. In no event, 
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however, will the statutory period for reply expire later than SIX MONTHS from the date of this 
final action. 

1 3 . Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Eunice Ng whose telephone number is 571-272-2854. The 
examiner can normally be reached on Monday through Friday, 8:30 a.m. - 5:00 p.m. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, David Hudspeth can be reached on 571-272-7843. The fax phone number for the 
organization where this application or proceeding is assigned is 571-273-8300. 

Information regarding the status of an application may be obtained from the Patent 
Application Information Retrieval (PAIR) system. Status information for published applications 
may be obtained from either Private PAIR or Public PAIR. Status information for unpublished 
applications is available through Private PAIR only. For more information about the PAIR 
system, see http://pair-direct.uspto.gov. Should you have questions on access to the Private PAIR 
system, contact the Electronic Business Center (EBC) at 866-217-9197 (toll-free). If you would 
like assistance from a USPTO Customer Service Representative or access to the automated 
information system, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000. 

IE. N./ 

Examiner, Art Unit 2626 



/David R Hudspeth/ 

Supervisory Patent Examiner, Art Unit 2626 



